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Abstract 

In this paper, we study the state complexities of union and intersection 
combined with star and reversal, respectively. We obtain the state com- 
plexities of these combined operations on regular languages and show that 
they are less than the mathematical composition of the state complexities 
of their individual participating operations. 



1 Introduction 

State complexity is one of the fundamental topics in automata theory. It is im- 
portant from both theoretical aspect and implications in automata applications, 
because the state complexity of an operation gives an upper bound of both time 
and space complexity of the operation. For example, programmers should know 
the largest possible number of states that would be generated before they per- 
form an operation in an application, since they need to allocate enough space 
for the computation and make an estimate of the time it takes. 

The research on state complexity can be recalled to 1950's [20]. However, 
most results on state complexity came out after 1990 [31 HI [5J O [TTJ [T31 [HI 
[TSl [ini HH [ISl [H] • Their research focused on individual operations, e.g. union, 
intersection, star, catenation, reversal, etc, until A. Salomaa, K. Salomaa and S. 
Yu initiated the study of state complexities of combined operations in 2007 [H] . 
In the following three years, many papers were published on this topic [1] [2| [3 

[Hi[i[ini[isi[n]- 

People are interested in state complexities of combined operations not only 
because it is a relatively new research direction but also because its importance 
in practice. For example, several operations are often applied in a certain or- 
der on languages in searching and language processing. If we simply use the 
mathematical composition of the state complexities of individual participating 
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operations, we may get a very huge value which is far greater than the exact 
state complexity of the combined operation, because the resulting languages of 
the worst case of one operation may not be among the worst case input lan- 
guages of the next operation [51 (THl HZl [H] . Although computer technology is 
developing fast, time and space should still be used efficiently. Thus, state com- 
plexities of combined operations are at least as important as those of individual 
operations. 

In [5T], two combined operations were investigated: {L{M) U L{N))* and 
{L{M) n L{N))* , where M and N are m-state and n-state DFAs, respectively. 
In [17j . Boolean operations combined with reversal were studied, including: 
(L(M) UL{N))^ and {L{M) r\L{N))^. One natural question is what are the 
state complexities of these combined operations if we exchanged the orders of the 
composed individual operations. For example, we perform star or reversal first 
and then perform union or intersection. Thus, in this paper, we investigate four 
particular combined operations: L{M)*UL{N), L{M)*nL{N), L{M)^UL{N) 
and L(M)^nL(7V). 

It has been shown in [53] that, (1) the state complexities of the union and 
intersection of an m-state DFA language and an n-state DFA language are 
both mn, (2) the state complexity of star of a fc-state DFA language is j2'', 
and (3), the state complexity of reversal of an Z-state DFA language is 2'. In 
this paper, we obtain the state complexities of L{M)* U L{N), L{M)* n L(N), 
L{M)^ U L{N) and L{M)^ n L{N) and show that they are all less than the 
mathematical compositions of individual state complexities for m,n>2. 

We prove that the state complexity of L{M)* LIL{N) is |2™ • n — n -|- 1 for m, 
n > 2 which is much less than the known state complexity of {L{M) U L{N))* 
([H]). We obtain that the state complexity of L(M)*nL(7V) is also \2'^-n-n+l 
for m, n > 2 whereas the state complexity of {L{M) n L{N))* has been proved 
to be 1 2™", the mathematical compositions of individual state complexities 
([21]). For L(M)^ U L{N) and L{M)^ n L{N), we prove both of their state 
complexities to be 2™ • n — n + 1 for m, n > 2 while the state complexities of 
(L(M) U L{N))^ and (L(M) n L{N))^ are both 2™+" - 2™ - 2" -h 2 ([l^). 

In the next section, we introduce the basic notations and definitions used in 
this paper. In Sections [21 SI E] and [H we investigate the state complexities of 
L(M)*UL(iV), L{M)*nL{N), L{M)^UL{N) and L{M)^nL{N), respectively. 
In Section [71 we conclude the paper . 

2 Preliminaries 

An alphabet S is a finite set of letters. A word w G S* is a sequence of letters 
in S, and the empty word, denoted by e, is the word of length 0. 

A deterministic finite automaton (DFA) is usually denoted by a 5-tuple A = 
{Q, E, S, s, F), where Q is the finite and nonempty set of states, E is the finite 
and nonempty set of input symbols, (5 : Q x E ^ Q is the state transition 
function, s G Q is the initial state, and F C Q is the set of final states. A DFA 
is said to be complete if 5 is a total function. Complete DFAs are the basic 
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model for considering state complexity. Without specific mentioning, all DFAs 
are assumed to be complete in this paper. We extend (5 to Q x E* — Q in the 
usual way. Then this automaton accepts a word w G S* if S{s, w)nF $. Two 
states in a DFA are said to be equivalent if and only if for every word it; £ S*, 
if A is started in either state with w as input, it either accepts in both cases or 
rejects in both cases. The language accepted by a DFA A is denoted by L{A). 
A language is accepted by many DFAs but there is only one essentially unique 
minimal DFA for the language which has the minimum number of states. 

A non- deterministic finite automaton (NFA) is also denoted by a 5-tuple 
B = (Q, S, S, s, F), where Q, S, s, and F are defined the same way as in a DFA 
and (5 : Q X E — >■ 2'3 maps a pair consisting of a state and an input symbol 
into a set of states rather than a single state. An NFA may have multiple 
initial states, in which case an NFA is denoted {Qj'E, 6, S, F) where S is the 
set of initial states. A language L is accepted by an NFA if and only if L is 
accepted by a DFA, and such a language is called a regular language. Two finite 
automata are said to be equivalent if they accepts the same regular language. An 
NFA can always be transformed into an equivalent DFA by performing subset 
construction. The reader may refer to [12l [S^ for more details about regular 
languages and automata theory. 

The state complexity of a regular language L is the number of states of the 
minimal, complete DFA accepting L. The state complexity of a class of regular 
languages is the worst among the state complexities of all the languages in the 
class. The state complexity of an operation on regular languages is the state 
complexity of the resulting languages from the operation. For example, we say 
that the state complexity of union of an m-state DFA language and an n-state 
DFA language is mn. This implies that the largest number of states of all the 
minimal, complete DFAs that accept the union of an m-state DFA language 
and an n-state DFA language, is mn, and such languages exist. Thus, state 
complexity is a worst-case complexity. 

3 State complexity of LI U L2 

We first consider the state complexity of U L2, where Li and L2 are regular 
languages accepted by m-state and n-state DFAs, respectively. It has been 
proved that the state complexity of is |2'" and the state complexity of 
Li U L2 is mn [T51 [23] ■ The mathematical composition of them is |2™ • n. In 
the following, we show that this upper bound can be lower. 

Theorem 1. For any m-state DFA M ~ (Qa[,'E,,6m,sm,Fj^i) and n-state 
DFA N = {Qn,^,Sn,sn,Fn) such that \Fm - {sai}\ = fc > 1, to > 2, n > 1, 
there exists a DFA of at most (2™^^ + 2"'^'^^^) • n — n + 1 states that accepts 
L{M)*UL{N). 

Proof. Let M — {Q ]\f ,1^, 6m , sm , Fm) be a complete DFA of to states. 
Denote \Fm ~ {sm}\ by F^. Then Fo = fc > 1 Let TV = (Qat, S, ^at, sjv, Fjv) 
be another complete DFA of n states. Let DFA M' — (Qm': 6m' ^ sa/'i Fw) 
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where 

sm' ^ Qm is a new start state, 
Qm' - {.SM'} U {P I P C (Qm -F^)kP^%] 
U{R I R C Qm k sm e R k RCiFaj^ 0}, 
^M'(sM',a) = {(^A/CsMia) for any a G S}, 

dM'{R, a) = {(5m (P, a)} for RCQm and a £ S if (5m a) n Fq = 0, 
6M'{R,a) — {(5m(P, a)} U {sa/} otherwise, 
Pa/' = {sM'} li{R \ RCQm k Rn Fm ^ 0}. 

It is clear that M' accepts L{M)* . In the second term of the union for Qa/' 
there are 2'""'^ — 1 states. And in the third term, there are (2*^ ~ 1)2™^'^^^ 
states. So M' has 2™"^ + 2"^~^~^ states in total Now we construct another 
DFA A = (Q, E, (5, s, P) where 

S = (sa/', Sat), 

Q = {{i,j) I i e Qa/' - {sM'},j e Qn} u {s}, 
'5((«,j),a) = ((5A./'(i,a),(5Ar(j,a)), G Q, a £ S, 
P = {(i,j) Me Pa/' orjeFjv}. 

We can see that 

L{A) = L{M') U L(iV) = L(Af)* U L{N). 

Note {sM',j) ^ Q, for j G Q^v — {sw}, because there is no transition going into 
Sa/' in DFA M'. So there are at least n — 1 states in Q are not reachable. Thus, 
the number of states of minimal DFA accepting L{M)* U L{N) is no more than 

IQI = (2™-i + 2™-'^-!) • + 1. □ 

If Sm is the only final state of M [k = 0), then L{M)* = L{M). 

Corollary 1. For any m-state DFA M = (Qm ^m, S]\i, Fm) and n-state 
DFA N — {Qn,T,,Sn, sn, Fpf), m > 1, n > 0, there exists a DFA A of at most 
12™ -n-n + l states such that L{A) = L{M)* U L{N). 

Proof. Let k be defined as in the above proof. There are two cases in the 
following. 

(I) k = 0. In this case, L{M)* = L{M). Then A simply needs at most m • n 
states, which is less than |2™ • n — n + 1 when m > 1. 

(II) k>l. The claim is clearly true by Theorem [TJ □ 

Next, we show that the upper bound |2™ ■ n — n + 1 is reachable. 

Theorem 2. Given two integers m > 2, n > 2, there exists a DFA M of m 
states and a DFA N of n states such that any DFA accepting L{M)* U L{N) 
needs at least |2™ • n — n + 1 states. 
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Proof. Let M = (Qm, ^m, 0, {to - 1}) be a DFA, where Qm = 
{0, 1, . . . , TO — 1}, — {a, &, c} and the transitions of M are 

Suih a) = i + \ mod to, i = 0, 1, . . . , to — 1, 

5m(0, b) = 0, (5A/(i, 6) = i + 1 mod to, i = 1, . . . , to — 1, 

i5m(«, c) = i, i = 0, 1, . . . , to — 1. 

The transition diagram of M is shown in Figure [1] 
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Figure 1: The transition diagram of the witness DFA M of Theorems [2] and [5] 

Let N — {Qn, Sn, 0, {n — 1}) be another DFA, where Qat = {0, 1, . . . , n - 1} 
and 

(5jv(«,a) = i, * = 0, l,...,n — 1, 
SN{i,b) — i, i — 0,1, . . . ,n — 1, 
5jv(i, c) = i + 1 mod n, i — 0,1, . . . ,n — 1. 

The transition diagram of N is shown in Figure [2] 

It has been proved in [24] that the minimal DFA accepting the star of an 
m-state DFA language has j2"^ states in the worst case. M is a modification 
of worst case example given in |24| by adding a c-loop to every state. So we 
design a |2™-state, minimal DFA M' = {Qm' ,^,^M' , S]\f' , Fm') that accepts 
L{M)*, where 

sm' ^ Qm is a new start state, 

Qm' = {sM'} U {P I P C {0, 1, . . . , TO - 2} & P 7^ 0} 

u{i? I P c {0, 1,...,TO - 1} & e P & TO - 1 e i?}, 

5M'{sM',a) = {5m(0, a) for any a e S}, 

6m'{R, a) = {^Af (P, a)} for R^Qm and aGSifTO — 1^ Sm{R, a), 
(5a/' (i?, a) = {5A/(P,a)} U {0} otherwise, 

Fm' = {SA/'}U{P I i? C {0,1,...,TO - 1} & TO - 1 G i?}. 
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Figure 2: The transition diagram of the witness DFA of Theorems [5] and [5] 

Then we construct a DFA A = (Q, E, S, s, F) accepting L{M)*UL{N) exactly 
as described in the proof of Theorem [1] where 

S = {SM',0), 

Q = {{ij) I i e Qm' - {sM'}J G Qn} U {s}, 
^{{■i,j),a) = {6M'{i,a),SN{j,a)), {i,j) eQ, aeJ:, 
F^{{i,j) \ i€FM' orj^n-l}. 

Now we need to show that A is a minimal DFA. 

(I) All the states in Q are reachable. 

For an arbitrary state (i, j) in Q, there always exists a string wiW2 such 
that S{{s\i,0), 1111102) = ihj), where 

Sm'{sm',wi) =i,wie {a, 6}*, 
5Ar(0, W2) = j, W2 e {c}*. 

(II) Any two different states and {12,32) in Q are distinguishable. 



1. ii 12, 32 ^ n — 1. We can find a string lui such that 

^((i2, j2),wi) i F, 

where wi £ {a,b}*, Sm' (11,^1) G F/i// and 5m(*2,wi) ^ 

2. ii ^ 12, 32 = n — I. There exists a string wi such that 

5{{ii,ji),wic) e F, 
S{{i2,j2),wic) i F, 

where w\ S {a, 6}*, 5m' f^i) G Fm' and 5Af'(i2,wi) ^ ^Af/ 
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3. h— 12 ^ Fm', ji 7^ J2- For this case, a string c" ^ can distinguish 
the two states, since (5((«i,ji), c""^"^!) £ F and (5((«2, J2), c""^"-'^) ^ 
F. 

4. ii = 12 e Fm', ji 7^ 32- A string 5™c"^^^-'^ can distinguish them, 
because (5((ii, ji), fe"c"-i-J'i) e F and (5((i2, ^2), fo'"c"-i-Ji) ^ F. 

Since ah the states in A are reachable and distinguishable, DFA A is minimal. 
Thus, any DFA accepting L{M)* U L{N) needs at least |2™ • n - n + 1 states. 
□ 

This result gives a lower bound for the state complexity of L(M)* U L{N). 
It coincides with the upper bound in Corollary [TJ So we have the following 
Theorem [3l 

Theorem 3. For any integer m > 2, n > 2, |2™ ■ n — n + 1 states are both 
sufficient and necessary in the worst case for a DFA to accept L{M)* U L{N), 
where M is an m-state DFA and N is an n-state DFA. 

4 State complexity of L(M)* n L{N) 

Since the state complexity of intersection on regular languages is the same as 
that of union [53] , the mathematical composition of the state complexities of star 
and intersection is also |2™. In this section, we show that the state complexity 
of L(M)* n L{N) is |2" • n - n + 1 which is the same as the state complexity 
ofL{M)*\jL{N). 

Theorem 4. For any m-state DFA M = (Qm,'^,Sm,sm,Fm) and n-state 
DFA N — {Qn, S, Sn, sn, F^) such that \Fm — = k > 1, m > I, n > 0, 

there exists a DFA of at most [2™^^ + 2™^'^^^) • n — n + 1 states that accepts 
L{M)*nL{N). 

Proof. We construct a DFA A accepting L{M)* n L{N) the same as in 
the proof of Theorem [1] except that its set of final states is 

F ^{{i,j)\i^FM-.3 ^Fn]. 

Thus, after reducing the n — 1 unreachable states {sM',j) ^ Q, for j £ Qn — 
{sn}, the number of states of A is sill no more than (2™^^+2™^''"^^)-n— □ 
Similarly to the proof of Corollary [U we consider both the case that M has 
no other final state except sm {L{M)* = L{M)) and the case that M has some 
other final states (Theorem|4]). Then we obtain the following corollary. Detailed 
proof may be omitted. 

Corollary 2. For any m-state DFA M = (Qm ^m, S]\i, Fm) and n-state 
DFA N = {Qn, Sn, sn, Fn), m > 1, n > 0, there exists a DFA A of at most 
|2™ -n-n + l states such that L{A) = L{M)* D L{N). 

Next, we show that this general upper bound of state complexity of L{M)* n 
L{N) can be reached by some witness DFAs. 



7 



Theorem 5. Given two integers m > 2, n > 2, there exists a DFA M of m 
states and a DFA N of n states such that any DFA accepting L[M)* n L(N) 
needs at least |2™ ■ n — n + 1 states. 

Proof. We use the same DFAs M and N as in the proof of Theorem [2j 
Their transition diagrams are shown in Figure [1] and Figure [21 respectively. 
Construct DFA M' = {Qm' Sm' , sm' , Fm') that accepts L{M)* in the same 
way. 

Then we construct a DFA A = {Q, S, S, s, F) accepting L{M)*nL{N) exactly 
as described in the proof of Theorem [2] except that 

F = {{i,n-l) I i e Fm,}. 

Now we prove that A is minimal. 

(I) Every state of A is reachable. 

Let be an arbitrary state of A. Then there always exists a string 

wiW2 such that S{{sm' ,0),wiW2) = where 

5m' {sm' ,wi) ^ i, wi e {a, &}*, 
6n{0,W2) = j, W2 e {c}*. 

(II) Any two different states and («2,j2) oi A are distinguishable. 

1. ii ^ i2- 

We can find a string wi such that 

S{{i,,n),w^c--'-^^)eF, 

5{{i2,32),w,c^'-^-niF, 
where Wi e {a, &}*, (5M'(ii,Wi) e Fm' and (5m'(*2,Wi) ^ Fm' ■ 

2. ^1=^2^ Fm-, ji ^ .h- 

There exists a string W2 such that 

S{{i,,n),w2c--'-^^)eF, 

m2.32).W2C--^-=-)iF, 

where wi S {a, 6}* and 5M'{ii,W2) G F^i . 

3. zi = Z2 e Fa//, ji ^ j2. 

di{ii,n),c^'-'~^^)eF, 
m2.32),c^'-^~mF 

Due to (I) and (II), A is a minimal DFA with |2™ • n — n + 1 states which 
accepts L(M)* nL(7V). □ 

This lower bound coincides with the upper bound in Corollary [51 Thus, the 
bounds are tight. 

Theorem 6. For any integer m > 2, n > 2, |2™ ■ n — n + 1 states are both 
sufficient and necessary in the worst case for a DFA to accept L{M)* H L{N), 
where M is an m-state DFA and N is an n-state DFA. 
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5 State complexity of Lf U L2 

In this section, we study the state complexity of U where Li and L2 are 
regular languages. It has been proved that the state complexity of Lf is 2™ 
and the state complexity of Li U L2 is mn [HI [23]. Thus, the mathematical 
composition of them is 2™ • n. In this section we will prove that this upper 
bound of state complexity of U L2 can not be reached in any case. We will 
first try to lower the upper bound in the following. 

Theorem 7. Let Li and L2 be two regular language accepted by an m-state and 
n-state DFAs, respectively. Then there exists a DFA of at most 2™ ■ n — n + 1 
states that accepts Lf U L2 ■ 

Proof. Let M ~ {Qm,^,Sm,sm,Fm) be a complete DFA of m states 
and Li = L{M). Let N = {Qn,J^,6n,sn,Fn) be another complete DFA of 
n states and L2 = L{N). Let M' = (Q m 5m' , Fm , {^m}) be an NFA with 
multiple initial states. Sm' (p, a) = g if (5m ((Z, a) ~ P where a £ S and p,q ^ Qm- 
Clearly, L{M') = L{M)^ — L^. After performing subset construction, we can 
get a 2^-state DFA A — {Qa,'^, 5a, sa, Fa) that is equivalent to M'. Since 
A has 2™ states, one of its final state must be Qm- Now we construct a DFA 
B = {Qb,^,Sb,sb,Fb), where 

Qb = I i G Qa, j e Qn}, 

SB = {SA, Sn), 

Fb = eQB\ieFAor j e Fn}, 

SBi{i,j),a) («',/), if SAii,a) = i' and (5^0', a) ^ j' , a £ S. 

It is easy to see that 5b{{Qmt3)to) G Fb for any j & Qn and a £ E. This 
means all the states (two-tuples) starting with Qi are equivalent. There are n 
such states in total. Thus, the minimal DFA accepting if U L2 has no more 
than 2™ • n — n + 1 states. □ 

This result gives an upper bound of state complexity of U L2. Now let's 
see if this bound is reachable. 

Theorem 8. Given two integers m > 2, n > 2, there exists a DFA M of m 
states and a DFA N of n states such that any DFA accepting L[M)^ U L{N) 
needs at least 2™ • n — n + 1 states. 

Proof. Let A/ = (Qa/, E, (5m, 0, {0}) be a DFA, where Qm = {0, l,...,m- 

1}, S — {a, b, c, d} and the transitions are 

(5m(0, a) ~ m — 1, Suii, a) = i — 1, « = 1, . . . , m — 1, 
(5m(0, b) = 1, Spiii, 6) = i, i = 1, . . . , m - 1, 
^m(0, c) = 1, (5m(1,c) = 0, SmU^c) = j, j = 2, . . . ,m - 1, 
5M{k, (i) = fc, fc = 0, . . . , m — 1. 

The transition diagram of M is shown in Figure[3l Let N = {Qn, 5n, 0, {0}) 
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Figure 3: The transition diagram of the witness DFA M of Theorems [8] and [TT] 

be another DFA, where Qn = {0, 1, . . . , n — 1}, S = {a, b, c, d} and the transi- 
tions are 

6N{i,a) — h i = 0,...,n — 1, 
(5jv(*, &) = i, i = 0, . . . , n — 1, 
6Nii,c) = i, i = 0, . . . , n — 1, 
5jv(i, d) = i + 1 mod n, i = 0, . . . , n — 1. 

The transition diagram of N is shown in Figure SI 

a , b , c 




Figure 4: The transition diagram of the witness DFA N of Theorems [8l and [TT] 

Note that M is a modification of worst case example given in [2^ for reversal, 
by adding a c?-loop to every state. Intuitively, the minimal DFA accepting 
L{M)^- should also have 2™ states. Before using this result, we will prove it 
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first. Let A = {Qa, S, Sa, {0}, Fa) be a DFA, where 

<3a = I g C Qm}, 

Ti = {a, b, c, d}, 

5a{p, e) = {j I Suii, e) =j,ie p}, p € Qa, e € S, 
i^A = {g I {0} eq,qe Qa}. 

Clearly, A has 2™ states and it accepts L{M)^. Now let's prove it is minimal. 

(i) Every state i G Qa is reachable. 

1. i = 0. 

\i\ = if and only if i = 0. Sa{M, b)=i = 9. 

2. |i| = l. 

Assume that i = {p}, < p < m — 1. ^^({0}, a^) = i- 

3. 2 < \i\ < m. 

Assume that i = {11,^2, ■ • • ,jfe}, < «i < i2 < . . . < i/c < m — 1, 
2 < k < m. 6a{{0},w) = i, where 

w = a6(ac)'^-*'=-i-ia6(ac)''=-i-''=-=-^ • • • ab{acy^-'^-'^a'K 

(ii) Any two different states i and j in Qa are distinguishable. 

Without loss of generality, we may assume that |i| > Let x G i — j- 
Then a string a"*~^ can distinguish these two states because 

SA{i,a"'--) e Fa, 
SA{j,a"'-=') i Fa. 

Thus, >1 is a minimal DFA with 2™ states which accepts L{M)^. Now let 
B = {Qb, S, Sb, {({0}, 0)}, Fb) be a DFA, where 

Qb = {{p,q) \ peQA- {Qm}, « e Qiv} U {{Qm,0)}, 
S = {a, b, c, d}, 

Fb = {(j>,q)\p€FAOvqG Fjv, (p, q) G Qb}, 



and for {p, q) e Qb, , e e S 
&B{{p,q),e) = 



{p',q') iiSA{p,e) =p', i5A,(g,e) = q', p' ^ Qm, 
{Qm ,0) if 6A{p,e) = Qm. 

As we mentioned in last proof, all the states (two-tuples) starting with Qm 
are equivalent. Thus, wc replace them with one state: {Qm, 0). It is easy to see 
that B accepts the language L{M)^ U L{N). It has 2™ • n - n + 1 states. Now 
lets see if B is a minimal DFA. 
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(I) All the states in Qb are reachable. 

For an arbitrary state {p,q) in Qb, there always exists a string d'^w such 
that SB{{{0},0),d'^w) = {p,q), where w G {a,b,c}* and Sa{{0},w) = p. 

(II) Any two different states (pi,qi) and {p2,q2) in are distinguishable. 

1- qi = 92- 

We can easily find a string d^w such that 

SBi(pi,qi),d'w) G Fb, 
SB{(p2,q2),d'w) ^ Fb, 

where i+qi mod n O.w E {a, 6, c}*, (5a (pi, w) G Fa and i5a(p2, w) ^ 
Fa. 

A string d'^~'^'^w can distinguish these two states where w G {a, &, c}* 
and 5a{pi,w) ^ Fa, because 

'5B((Pi,gi),d"-^^w;) G Fb, 
SB{{P2,q2),d"-'"w) i Fb. 

3. pi ^ P2, qi # q2- 

We first find a string u; G {a, 6, c}* such that Sa{pi,'w) G Fa and 
Sa{P2,w) ^ Fa- Then it is clear that 

<5B((pi,gi),d"-*iu;) G Fb, 
5B{{P2.q2),d^-'''w) i Fb. 

Since all the states in B are reachable and distinguishable, DFA B is minimal. 
Thus, any DFA accepting L{M)^' U L{N) needs at least 2™ - n- n + l states. 
□ 

This result gives a lower bound for the state complexity of L{M)^ U L{N). 
It coincides with the upper bound. So we have the following Theorem [S] 

Theorem 9. For any integer m > 2, n > 2, 2™ • n — n + 1 states are both 
sufficient and necessary in the worst case for a DFA to accept L{M)^ UL{N), 
where M is an m-state DFA and N is an n-state DFA. 

6 State complexity of Lf fl L2 

The mathematical composition of the state complexities of reversal and inter- 
section is also 2™ • n, since the state complexities of intersection and union are 
the same )24 . In this section, we will show that the state complexity of Lf D L2 
is also 2™ • n — n + 1, which is the same as that of Ff U i2. We will start with 
an upper bound less than the mathematical composition. 
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Theorem 10. Let Li and L2 be two regular language accepted by an m-state 
and n-state DFAs, respectively. Then there exists a DFA of at most 2'"-ri — n + 1 
states that accepts n L2 ■ 

Proof. Let M — {Qm,^,Sm,sm,Fm) be a complete DFA of m states 
and Li = L{M). Let N = (Qtv, Jjv, sat, i^jv) be another complete DFA of 
n states and L2 = L{N). Let M' = {Qm,'^,Sm',FmAsm}) be an NFA with 
multiple initial states. Sm' (p, a) = g if (5m (9, a) = P where a S S and p,q ^ Qai- 
Clearly, L{M') = L{M)^ = Lf-. After performing subset construction, we can 
get a 2'"-state DFA A ~ {Qa, Sa, sa, Fa) that is equivalent to M'. Since A 
has 2™ states, one of its nonfinal state must be a sink state, denoted by tA- 
Now we construct a DFA B — {Qb, S, 6b, s_b, Fb), where 

Qb = {(«, I « e Qa, j e Qat}, 

SB = {sa,sn), 

Fb = {{i,j) eQB\ieFA,j ^Fn}, 

SBi{i,j),a) = if (5^(i,a) = i' and (5Ar(j, a) = j', a e S. 

We can see that SBiitAjj),^) ^ Fb for any j G Qn and a £ E, since tA is 
the sink state of DFA A which accepts L{M)^ . This means all the states (two- 
tuples) starting with tA are equivalent. There are n such states in total. Thus, 
after reducing them to one state, we can see the number of states of A is sill no 
more than 2'" ■ n — n + 1. □ 

Theorem [TUl gives an upper bound of state complexity of Lf n L2. Now let's 
see if this bound is reachable. 

Theorem 11. Given two integers m > 2, n > 2, there exists a DFA M of m 
states and a DFA N of n states such that any DFA accepting L{M)^' n L(iV) 
needs at least 2™ ■ n — n + 1 states. 

Proof. We use the same DFAs M and N as in the proof of Theorem [51 
Their transition diagrams are shown in Figure [3] and Figure |4l respectively. It 
has been shown in the proof of Theorem [8] that the minimal DFA accepting 
L(M)^ has 2™ states. So we design a minimal DFA A = {Qa, S, 5a, {0}, Fa) 
that accepts L{M)^ in the same way, where 



Qa = {9 I 9 C Qm}, 

S — {a, 6, c, d}, 

5A{p,e) = {j I (5A/(i,e) = j, i e p}, p e Qa, e e S, 

Fa = {g I {0} e g, g e Qa}. 

. Note that A must have a sink state, denoted by tA. 

Next we construct a DFA B = {Qb,^,Sb, ({0},0),Fb) accepting L{M)^ n 
L{N) , where 



Qb 



Fb 



{{p,q) \ P e Qa - {tA}, q e Qn} ^ {{tA,0)}, 
{a,b, c, d}, 

{{p, q) \ P & Fa, q & Fn, ip, q) e Qb}, 
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and for {p,q) E Qb, ,e E 

As we mentioned in last proof, all the states starting with Ia are equivalent. 
Thus, we replace them with one sink state: {tA^Q)■ Clearly, B accepts the 
language L{M)^ n L{N) and it has 2™ ■ n — n + 1 states. Next we prove that 
i? is a minimal DFA. 

(1) Every state of B is reachable from ({0}, 0). 

Let {p, q) be an arbitrary state of B. Then there always exist a string d'^w 
such that Sb{{{0}, 0),d'^w) — {p, q), where w E {a, b, c}* and (5yi({0}, w) = 
P- 

II Any two different states {pi,qi) and (^2,92) of B are distinguishable. 

1- qi = 92- 

In this case, we can find a string d^w such that 

SBi{pi,qi),d'w) E Fb, 
5B{{p2,q2),d'w) i Fb, 

where i+gi mod n = 0, G {a, 6, c}*, (5a (Pi, w) E Fa and Sa{p2i w) ^ 
Fa- 

2- Pi =P2, qi 92- 

A string d"'~'^^w can distinguish states {pi,qi) and (^2,92), where 
w E {a, 6, c}* and 5a{jPitw) € Fa, because 

fe((pi,(?i),d"-«iw) E Fb, 
fe((P2,g2),rf"-«^H i Fb. 

3- Pi ^P2, qi ^ 92- 

Since A is a minimal DFA and pi ^ P2, there always exists a string 
w E {a,b,c}* such that dA{pi,w) E Fa and Sa{p2,w) ^ F^. Then it 
is clear that 

dB{{Pi,qi),d''-'''w) E Fb, 
SB{{P2,q2),d''-'''w) i Fb. 

Now we know DFA B is minimal because all the states in B are reachable 
and distinguishable. Thus, any DFA accepting L{M)^ D L{N) needs at least 
2™ • n - n + 1 states. □ 

Theorem [TT] gives a lower bound of state complexity of L{M)^' n L{N). It 
coincides with the upper bound shown in Theorem llOl So we have the following 
theorem. 

Theorem 12. For any integer m > 2, n > 2, 2™-n — n + 1 states are both 
sufficient and necessary in the worst case for a DFA to accept L{M)^nL{N), 
where M is an m-state DFA and N is an n-state DFA. 
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7 Conclusion 



In this paper, we have studied the state complexities of union and intersection 
combinc^d with star and reversal. We have proved the state complexities of 
four particular combined operations, including: L{M)* UL(7V), L{M)* C\L{N), 
L{M)^UL{N) and L{M)^r\L{N) for m > 2 and n > 2. They are less than the 
mathematical composition of state complexities of its component operations. 
The state complexities of the four combined operations are also less than the 
state complexities of the combined operations composed of the same individual 
operations h\it in different orders. The reason of this is that the state com- 
plexities decrease when we perform union and intersection in the end instead of 
star or reversal. This makes the order of the state complexities reduced from 
0(2™+") to 0{2"^n). An interesting question is: why are the state complexities 
of L{M)* U L{N) and L{M)* f) L{N) the same whereas the state complexities 
of (L(M) U L{N))* and (L(M) n L{N))* are different? 
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